首页> 外文OA文献 >Procedural Generation of Videos to Train Deep Action Recognition Networks

【2h】

Procedural Generation of Videos to Train Deep Action Recognition Networks

机译：程序生成视频以训练深度行动识别网络

代理获取

本网站仅为用户提供外文OA文献查询和代理获取服务，本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文，但由于OA文献来源多样且变更频繁，仍可能出现获取不到、文献不完整或与标题不符等情况，如果获取不到我们将提供退款服务。请知悉。

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

Deep learning for human action recognition in videos is making significantprogress, but is slowed down by its dependency on expensive manual labeling oflarge video collections. In this work, we investigate the generation ofsynthetic training data for action recognition, as it has recently shownpromising results for a variety of other computer vision tasks. We propose aninterpretable parametric generative model of human action videos that relies onprocedural generation and other computer graphics techniques of modern gameengines. We generate a diverse, realistic, and physically plausible dataset ofhuman action videos, called PHAV for "Procedural Human Action Videos". Itcontains a total of 39,982 videos, with more than 1,000 examples for eachaction of 35 categories. Our approach is not limited to existing motion capturesequences, and we procedurally define 14 synthetic actions. We introduce a deepmulti-task representation learning architecture to mix synthetic and realvideos, even if the action categories differ. Our experiments on the UCF101 andHMDB51 benchmarks suggest that combining our large set of synthetic videos withsmall real-world datasets can boost recognition performance, significantlyoutperforming fine-tuning state-of-the-art unsupervised generative models ofvideos.

机译：用于视频中人类动作识别的深度学习取得了长足的进步，但由于其依赖于昂贵的大型视频收藏的手动标签而减慢了学习速度。在这项工作中，我们调查了用于动作识别的综合训练数据的生成，因为它最近显示了对其他各种计算机视觉任务的有希望的结果。我们提出了一种人类动作视频的可解释参数生成模型，该模型依赖于过程生成和现代游戏引擎的其他计算机图形技术。我们生成了人类动作视频的多样化，现实且在物理上合理的数据集，称为“程序性人类动作视频” PHAV。它总共包含39,982个视频，其中35个类别的每个动作都包含1,000多个示例。我们的方法不仅限于现有的动作捕捉序列，而且我们在程序上定义了14个合成动作。我们引入了一种深多任务表示学习架构，即使动作类别不同，也可以将合成视频和真实视频混合使用。我们在UCF101和HMDB51基准上进行的实验表明，将我们的大量合成视频与少量真实世界的数据集结合在一起，可以提高识别性能，大大优于微调的最新无监督生成视频模型。

著录项

作者
de Souza, César Roberto; Gaidon, Adrien; Cabon, Yohann; Peña, Antonio Manuel López;
展开▼
作者单位

展开▼
年度 2017
总页数
原文格式 PDF
正文语种
中图分类

相似文献

外文文献
中文文献
专利

1. Human action recognition in videos with articulated pose information by deep networks [J] . Farrajota M., Rodrigues Joao M. F., du Buf J. M. H. Pattern Analysis and Applications . 2019,第4期

机译：深度网络在具有清晰姿势信息的视频中对人的动作进行识别
2. Deep Image-to-Video Adaptation and Fusion Networks for Action Recognition [J] . Liu Yang, Lu Zhaoyang, Li Jing, IEEE Transactions on Image Processing . 2020,第期

机译：用于动作识别的深度图像到视频自适应和融合网络
3. Human Action Recognition in Video Sequences Using Deep Belief Networks [J] . Abdellaoui Mehrez, Douik Ali Traitement du Signal . 2020,第1期

机译：使用深度信仰网络的视频序列中的人类行动识别
4. Procedural Generation of Videos to Train Deep Action Recognition Networks [C] . César Roberto de Souza, Adrien Gaidon, Yohann Cabon, IEEE Conference on Computer Vision and Pattern Recognition . 2017

机译：视频的过程生成，以训练深度动作识别网络
5. Action Recognition from Videos using Deep Neural Networks. [D] . Ghewari, Rishikesh Sanjay. 2017

机译：使用深度神经网络从视频中识别动作。
6. Marker-Less Motion Capture of Insect Locomotion With Deep Neural Networks Pre-trained on Synthetic Videos [O] . Ilja Arent, Florian P. Schmidt, Mario Botsch, 2021

机译：与综合视频预培训的深神经网络的昆虫运动的标记运动捕获
7. AdaScan: Adaptive Scan Pooling in Deep Convolutional Neural Networks for Human Action Recognition in Videos [O] . Kar, Amlan, Rai, Nishant, Sikka, Karan, 2017

机译：adascan：深度卷积神经网络中的自适应扫描池视频中的人类行为识别

Procedural Generation of Videos to Train Deep Action Recognition Networks

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅